OpenWPM: An automated platform for web privacy measurement

نویسندگان

  • Steven Englehardt
  • Chris Eubank
  • Peter Zimmerman
  • Dillon Reisman
  • Arvind Narayanan
چکیده

Web measurement techniques have been highly influential in online privacy debates and have brought transparency to the online tracking ecosystem. Due to its complexity, however, web privacy measurement remains a specialized research field. Our aim in this work is transform it into a widely available tool. First, we analyze over 30 web privacy measurement studies, identify several methodological challenges for the experimenter, and discuss how to address them. Next, we present the design and implementation of OpenWPM, a flexible, modular web privacy measurement platform that can handle any experiment that maps to a general framework. It supports parallelism for speed and scale, automatic recovery from failures of the underlying browser, and realistic simulation of users. OpenWPM is open-source and has already been used as the basis of several published studies on web privacy and security. We show how our generic platform provides a common foundation for these diverse experiments, including a new study on the “filter bubble” which we present here.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Web Privacy Measurement : Scientific principles , engineering platform , and new results Draft – Jun 1 , 2014

The results of web privacy measurement have been very influential in online privacy debates. As a research field, however, web privacy measurement is immature and fragmented, and has not yet acquired an identity as a unified discipline. There are significant scientific and engineering challenges but the solutions tend to be ad-hoc. We identify 32 web privacy measurement studies, cast them as in...

متن کامل

Automating a Modified Personal Software Process

Personal Software Process (PSP) is a defined software development framework that includes defined operations, measurement and analysis techniques to assist software engineers to understand and build their own skills in order to improve their own personal performance. Even though several published studies have suggested that adopting PSP results in improved size and time estimation, and improved...

متن کامل

Token Attempt: The Misrepresentation of Website Privacy Policies through the Misuse of P3P Compact Policy Tokens (CMU-Cylab-10-014)

Platform for Privacy Preferences (P3P) compact policies (CPs) are a collection of three-character and four-character tokens that summarize a website’s privacy policy pertaining to cookies. User agents, including Microsoft’s Internet Explorer (IE) web browser, use CPs to evaluate websites’ data collection practices and allow, reject, or modify cookies based on sites’ privacy practices. CPs can p...

متن کامل

Towards Usable Privacy Policies: Semi-automatically Extracting Data Practices From Websites’ Privacy Policies

1. MOTIVATION Natural language privacy policies have become the de facto standard “notice and choice” method on the Web, in order to communicate a website's data practices. Yet, website privacy policies are often complex and difficult to understand. As a result, few users bother to read them [9]. It has been proposed to improve notice and choice mechanisms by making privacy practices machine-re...

متن کامل

Image flip CAPTCHA

The massive and automated access to Web resources through robots has made it essential for Web service providers to make some conclusion about whether the "user" is a human or a robot. A Human Interaction Proof (HIP) like Completely Automated Public Turing test to tell Computers and Humans Apart (CAPTCHA) offers a way to make such a distinction. CAPTCHA is a reverse Turing test used by Web serv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016